Design and Implementation of OGSA-DAI-RDF
نویسنده
چکیده
This paper presents the OGSA-DAI-RDF middleware that extends OGSA-DAI access to RDF database s ystems, e.g., Sesame and Jena. Several OGSA-DAI activities for handling RDF data and ontology are imp lemented. The query language interface is based on SPARQL query language. Introduction The National Institute of Advanced Science and Technology (AIST) of Japan started a 5-year project called AIST-SOA (Service Oriented Architecture). One aim of the project is to develop service-based middleware that utilizes Grid technology and Semantic Web technology. To support the AIST-SOA data platform, RDF (Resource Description Framework), which is a common metadata representation format used in semantic web applications, is very important. For example, we have conducted the following RDF-related grid research as a part of AIST-SOA. 1. Distributed RDF processing that provides scalable RDF storage: One approach is to use P2P technology like a DHT (distributed hash table) and support RDF query processing like SPARQL and RDQL. 2. High-level resource management that uses Semantic Web technology: For example, Semantic MDS supports the ontology for grid resource management and discovery. This system uses OWL-S and RDF storage as the resource management database. Therefore, it is very important to provide a service-based interface for RDF databases. To develop the database middleware, we already have several OGSA-DAI based projects, such as OGSA-WebDB; it is a very natural process to extend the OGSA-DAI interface to support RDF databases. 2. OGSA-DAI-RDF 2.1 RDF-handling Activities OGSA-DAI provides a framework called an Activity for extending the system. There are several kinds of Activities for relational databases, XML databases, data transport, and data conversion. Using this framework, we implemented a family of RDF-handling Activities, including the following examples: 1) SPARQL QueryStatement Activity 2) RDF Bulkload Activity 3) RDF ResourceManagement Activity 4) RDF CollectionManagement Activity For instance, the SPARQL QueryStatement Activity receives the W3C SPARQL representation and gets the result in RDF XML format. W3C also discusses the SPARQL protocol, and the message format used for OGSA-DAI-RDF is designed to be similar to those specifications. An example PerformDocument for SPARQL is shown in Fig.1. ResourceManagement Activity provides create and delete functions to manipulate RDF statements. CollectionManagement Activitiy supports create, delete and list functions to RDF data repository(Model for Jena). Our current implementation supports Jena and Sesame. Activities are designed to have as much common interface as possible between these products. Fig.2 shows the data service instance which has Sesame and Jena resources. (Fig.1. Perform Document for SPARQL) (Fig. 2. Extended Databrowser to support SPARQL) Using these activities, it is possible to manipulate individual RDF statements/graphs. 2.2 Ontology Handling Activity We also implemented several activities to handle ontology data. These activities are based on the reasoning function of Jena and supports OWL and RDFS. Ontology can be specified as an option of SPARQL QueryStatement Activity. It supports activity so that it is possible to pass ontology data from other activities. 2.2 Configuration File Support In order to extend the OGSA-DAI, we developed a new data resource type called RDF. The extended configuration file of OGSA-DAI (data.service.resource.properties) supports the following examples. (Fig. 3. Part of the configuration File for RDF databases) In this configuration, users can specify the RDF storage product. For Jena, it is possible to make RDF repository both on the RDB and on main memory. For Sesame, Sesame has its own tomcat-based http interface; however, we access Sesame directly, so that there is no other tomcat container. 2.3 Distributed RDF Processing based on Activity Programming Based on the architecture of OGSA-DAI, it is possible to combine other data-processing activities with RDF processing. For example, it is possible to convert the SPARQL result with the XSLTransform activity. Fig.4. shows the workflow of activities. We are implementing a simple distributed processing service on top of this software. This is also done with the Activity programming of OGSA-DAI framework. 1) Create a RDF resource and inserts data from other activity. 2) Query is issued to the resource using the ontology data which is also delivered from other activity (Fig.4.1 Activity workflow example) (Fig.4.2. Simple RDF Distributed Processing Example) 2.4 Extended Data Browser OGSA-DAI’s data browser is extended to support RDF-type data resources. The browser can also supports other resource type and if one selects resources with RDF type, the query interface will change to submit a SPARQL Query. The resulting format of the query is represented in tree/folder format as shown in Fig.5. (Fig. 5. Folder view of XML of a SPARQL-result format) 3. Status and Future Directions The first version of OGSA-DAI-RDF has been implemented on top of OGSA-DAI 2.1, and 1Q will be released in 2006. Online demonstration will be presented at the workshop. We are aiming to make a reference implementation for the coming RDF realization of GGF DAIS standard specifications.
منابع مشابه
OGSA-DAI 3.0 – The Whats and the Whys
OGSA-DAI provides an extensible framework that allows data resources to be incorporated into Grid fabrics. The current OGSA-DAI release, version 3.0, is a complete top-to-bottom redesign and implementation of the OGSA-DAI product. A number of fundamental conceptual and design changes are introduced in this release. In this paper we describe the motivation behind this redesign and provide an ove...
متن کاملThe design and implementation of Grid database services in OGSA-DAI
This paper presents a high-level overview of the design and implementation of the core components of the OGSA-DAI project. It describes the design decisions made, the project’s interaction with the Data Access and Integration Working Group of the Global Grid Forum and provides an overview of implementation characteristics. Further details of the implementation are provided in the extensive docu...
متن کاملA Grid-Enabled Digital Library System for Natural Disaster Metadata
The need to organize and publish metadata about European research results in the field of natural disasters has been met with the help of two innovative technologies: the Open Grid Service Architecture (OGSA) and the Resource Description Framework (RDF). OGSA provides a common platform for sharing distributed metadata securely. RDF facilitates the creation and exchange of metadata. In this pape...
متن کاملBenchmarking the OGSA-DAI Middleware
One important contribution to the community that is developing Grid middleware is the definition and implementation of benchmarks and tools to assess the performance and dependability of Grid applications and the corresponding middleware. In this paper, we present an experimental study that was conducted with OGSA-DAI, a popular package of middleware that provides access to remote data resource...
متن کاملProtecting Application Developers – A Client Toolkit for OGSA-DAI
The OGSA-DAI project has developed a client toolkit for the OGSA-DAI middleware. This toolkit comprises of a set of high-level APIs intended to protect application developers from changes in Grid specifications that impact upon OGSA-DAI and remove the requirement to manipulate XML documents either by hand or programmatically. In doing so, the client toolkit aims to improve the usability and sho...
متن کامل